TREC-3 Ad-Hoc, Routing Retrieval and Thresholding Experiments using PIRCS

نویسندگان

  • Kui-Lam Kwok
  • Laszlo Grunfeld
  • David D. Lewis
چکیده

The PIRCS retrieval system has been upgraded in TREC-3 to handle the full English collections of 2 GB in an efficient manner. For ad-hoc retrieval, we use recurrent spreading of activation in our network to implement query learning and expansion based on the best-ranked subdocuments of an initial retrieval. We also augment our standard retrieval algorithm with a soft-Boolean component. For routing, we use learning from signal-rich short documents or subdocument segments. For the optional thresholding experiment, we tried two approaches to transforming retrieval status values (RSV’s) so that they could be used to partition documents into retrieved and nonretrieved sets. The first method normalizes RSV’s using a query self-retrieval score. The second, which requires training data, uses logistic regression to convert RSV’s into estimates of probability of relevance. Overall, our results are highly competitive with those of other participants.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

TREC-7 Ad-Hoc, High Precision and Filtering Experiments using PIRCS

In TREC-7, we participated in the main task of automatic ad-hoc retrieval as well as the high precision and filtering tracks. For ad-hoc, three experiments were done with query types of short (title section of a topic), medium (description section) and long (all sections) lengths. We used a sequence of five methods to handle the short and medium length queries. For long queries we employed a re...

متن کامل

TREC-8 Ad-Hoc, Query and Filtering Track Experiments using PIRCS

In TREC-8, we participated in automatic ad-hoc retrieval as well as the query and filtering tracks. The theme of our participation is ‘retrieval lists combination’, and the technique is applied throughout our experiments to various degree. It is pointed out that our PIRCS system may be considered as a combination of probabilistic retrieval model and a language model approach. For adhoc, three t...

متن کامل

TREC-4 Ad-Hoc, Routing Retrieval and Filtering Experiments using PIRCS

Our ad-hoc submissions are pircs1 which is fully automatic, and pircs2 which involves manually weighting some terms and adding some new words to the original topic descriptions. The number of words added are minimal. Both methods involve training and query expansion using the best-ranked subdocuments from an initial retrieval as feedback. For our routing experiments we make use of massive query...

متن کامل

TREC-5 English and Chinese Retrieval Experiments using PIRCS

Two English automatic ad-hoc runs have been submitted: pircsAAS uses short and pircsAAL employs long topics. Our new avtf*ildf term weighting was used for short queries. 2-stage retrieval were performed. Both automatic runs are much better than the overall automatic average. Two manual runs are based on short topics: pircsAM1 employs double weighting for user-selected query terms and pircsAM2 a...

متن کامل

TREC-6 English and Chinese Retrieval Experiments using PIRCS

For Trec-6 ad-hoc experiments, we continue to use twostage retrieval with pseudo-feedback from top-ranked unjudged documents for both Chinese and English. We perform three types of retrieval characterized by queries formed using title only, description only and all sections of the given topics. For short queries mainly derived from title or description section, query terms are weighted by avera...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994